Media delivery platform

ABSTRACT

A method of delivering an audio and/or visual media file including, for example, one or more of full or partial master recordings of songs, musical compositions, ringtones, videos, films, television shows, personal recordings, animation and combinations thereof, over the air wirelessly, from one or more servers to an electronic device with or without an Internet connection, said method comprising transmitting and audio and/or visual media file in compressed format to said electronic device, and wherein the electronic device is effective to receive said audio and/or visual file and playback said audio and/or visual content on demand by a user.

RELATED APPLICATION

This application is a continuation of application Ser. No. 12/322,615filed Feb. 4, 2009, which in turn is a continuation of application Ser.No. 10/183,756 filed Jun. 26, 2002, now U.S. Pat. No. 7,548,875 issuedJun. 16, 2009, which claims the benefit of U.S. Provisional ApplicationNo. 60/301,681 filed on Jun. 27, 2001, U.S. Provisional Application Ser.No. 60/303,115 filed on Jul. 3, 2001, U.S. Provisional Application Ser.No. 60/312,450 filed on Aug. 14, 2001, and U.S. Provisional ApplicationSer. No. 60/343,159 filed Oct. 26, 2001, each of which is hereby fullyincorporated herein by reference.

TECHNICAL FIELD

This invention relates to a method of delivery and play back of soundand image files for wireless and non-wireless electronic devices.

BACKGROUND ART

The general concept for delivery of sound recordings or clips and visualrecordings or clips by way of the Internet is known and described invarious U.S. patent applications. (See Bernard et al., U.S. Pat. No.5,918,213; Kaplan, U.S. Pat. No. 5,963,916; Barbara, U.S. Pat. No.5,926,789; and Doerr et al., U.S. Pat. No. 5,949,411.) Such methods aretypically used to sell products to consumers. For example, a web pagefrom Amazon.com allows a user to listen to samples of music for beforepurchasing compact discs (CD's) by mail.

Also, cell phones may be programmed to ring with a tune of a song ormusical composition, and have become increasingly popular. However,cellular phones currently on the market can only be either programmed toonly play music (such as conventional MP3 type phones) or to deliver“ring tones” with an electronic chime or ring tone rather than an actualrecorded song, human voice, or musical composition. Additionally, thesering tones must be factory installed in the telephone or the deliverymethods just directly interface with the Internet and require theconsumer to be on line to access and download a particular mechanicalring tone.

At the same time, various methods have been developed and are being usedto enable a phone user to make more effective use of the variety oftelephone service now available. For example, “caller ID” function isone such feature which allows the recipient of an incoming call toidentify the caller based on textual information provided on a telephonedisplay panel. A patent to Borland, U.S. Pat. No. 6,178,230 discloses animproved telephone system and method that determines the identity of theperson being called for a telephone having more than one user and canidentify the person being called by sounding a distinctive ringassociated with the person being called. A mechanical ring tone isplayed depending upon the caller ID signal received to orally alert thetelephone user as to who is calling without reading the telephone'sdisplay panel.

DISCLOSURE OF INVENTION

The present invention provides an improved method for delivery and playback of sound and image files which include songs, musical compositions,and other sound recordings cartoons, movies, television shows, or anyother type of performance, which may be copyright registered, as well asnon copyright registered personal recordings (e.g., personal soundrecordings, family photos, home movies, etc.). This new method includesthe use of sound and/or image clips, which can be snippets or fullfiles, as alerts for a variety of electronic devices or for playing on ahandheld device. A collection or library of uniquely selected and/oredited clips may also be provided to the consumer in a manner far moreconveniently on conventional telephone equipment than previouslyavailable.

The method provides the consumer with a unique way of accessing andbrowsing through selectable files which may be Internet based orindependent of the Internet. Additionally, the unique delivery methodprovides a seller or service provider with a convenient and moreefficient way of promoting and selling entire sound and image fileswhich include downloadable music, movies, films, shows, and items suchas records, cassette tapes, CDs, videos, and DVDs.

Algorithms are provided for the delivery, storage and playback of thesound files, including a delivery method algorithm, a parametricoptimization and compression algorithm, and an error correctionalgorithm.

According to one embodiment, sound files are accessed by a cellular orlandline telephone for allowing the consumer to browse, download, hearand/or purchase sound files or use sound files including sound clips asringer sounds. In contrast to the conventional ring tones or musicalchimes used to ring cellular phones currently on the market, the currentinvention provides a method for ringing cellular phones (both analogueand digital) and landline telephones with real sound recordingsincluding real music, which may be songs lifted from copyrightregistered CD tracks, and may comprise human voice, various instrumentsounds, and other sound effects of a high quality. Instead of simplytones being played the higher fidelity musical composition can be playedby the telephone or other handset with a degree of fidelity previouslyunavailable using conventional methods. The high degree of fidelity isachieved using data compression, error correction and parametricoptimization algorithms adaptable to conventional telephones and otherhandheld devices.

A software based system for encoding the hardware of existing cellularphones at the time of manufacturing with delivery, storage, and playbackcapabilities in accordance with the present invention is provided, suchthat additional hardware is not required. (Only a suitable speaker needbe required with most telephones already possessing the necessaryquality of speaker.) The ability to provide this technology without theneed for extra hardware is very significant, particularly to thecellular phone industry, as it is especially desirable to make cellularphones as lightweight and as small as possible and at the lowest cost.

An accessory attachment to standard telephones can however beincorporated to implement the delivery, storage, and playbackcapabilities of the present invention to existing landline and cellulartelephones which have not been encoded at the time of their manufacture,if necessary. Such accessory attachments are compatible with existingtelephones, and may be sold separately. Also, a micro chip may beembedded in landline telephones for providing the telephone withbrowsing, delivery, storage, and playback capabilities of the presentinvention.

The accessory attachment or telephone encoded with software and/orincluding hardware for providing delivery, storage, and playbackcapabilities as described herein, may be manufactured with embeddedsound files including sound clips, such that a user can immediately playback the files, including use the files as ringer sounds, without havingto first download any files.

Additionally, upon hearing a sound clip on the telephone, a user maychoose to download the entire unedited sound file for a fee or purchasean item associated with the sound clip (e.g., record, cassette tape, CD,video, or DVD) by pressing a designated button on the accessoryattachment or keys on the telephone keypad. As such, sound clips whichhave been downloaded to, or preprogrammed on a cell phone, may encourageand stimulate the sale of downloadable files and/or items associatedwith the clips by allowing the user to make an impulsive purchaseimmediately upon hearing the clips.

The accessing of sound and/or image files by other electronic devices,such as home phones, computers, pagers, doorbells, alarms, palm pilots,watches, clocks, PDAs etc., for either allowing the consumer to browse,download, hear, view, and/or purchase sound recordings, image files, orassociated items, or to use sound and/or image clips as alerts is alsopart of the invention and not limited to solely telephones. Newelectronic devices, whose independent purpose is to allow the user tobrowse, receive, store and play sound and image files, including clips,according to the present invention are also described.

A security feature may be included on such electronic devices adaptedfor allowing a consumer to access and use sound and image filesaccording to the present invention. This feature is designed to preventintellectual property abuse by consumers' unauthorized dissemination andreproduction of copyright protected material. The downloaded chips orrecordings are coded and cannot be downloadable or transferred to unitsother than the consumers' pre-selected layer or telephone.

Also, a tracking feature for keeping a record of every song downloadedand/or each time a song is played can be incorporated for providingperforming rights organizations or songwriters' organizations with anaccurate method for determining royalty payments to writers andperformers of music.

Additionally, a website suitable for viewing and selecting downloadingsound and/or image clips or entire files may be used for giving theconsumer and music or image seller a unique way of transacting the saleof such files or other associated items such as records, cassette tapes,CD's, videos, or DVD's. The website may allow the clips to be stored ona user's computer, providing the user the ability to readily access theclips for downloading the clips to an electronic devise, using the clipsas computer alerts, or playing the clips on the computer. The user mayalso purchase files or items associated with the clips through thecomputer and/or website.

The system of the present invention may also allow the consumer tobrowse through hundreds or thousands of sound and/or image clips and/orfiles for the purpose of downloading to electronic devices with anoption to purchase an associated record, cassette tape, CD, video orDVD, or download the full unclipped sound and/or image file.

Furthermore, the delivery of files including clips is not limited to webbased applications. Unlike conventional methods which require computerplug-in devices for delivering and transferring digital music, thecurrent invention may use a delivery method which allows the user tobrowse, download, and listen to or watch sound or image files withoutthe need for hand wired plug-in devices or a computer connection to theInternet.

BRIEF DESCRIPTION OF DRAWINGS

FIG. 1 is a general schematic diagram illustrating the basic componentsof a wireless transmission system for a landline or cellular telephone.

FIG. 2 is a schematic diagram of a wireless transmission system for acellular phone.

FIG. 3 is a schematic diagram of a board system implemented in anaccessory unit of the system of FIG. 2.

FIG. 4 is a schematic diagram of a server software system for the systemof FIG. 2.

FIG. 5 is a flow chart illustrating a data transmission method.

FIGS. 6 A-D illustrate the electrical schematics of a mobile telephoneaccessory unit in accordance with the present invention.

FIG. 7 illustrates an image of a printed circuit board for the accessoryunit of FIGS. 6 A-D.

FIG. 8 is an exploded side view illustrating the assembly of a cellularphone accessory unit and cellular phone attachment to the accessoryunit.

FIG. 9 is a perspective view of the cellular phone accessory unit andcellular phone of FIG. 8, showing the cellular phone attached to theback of the phone connector and mounting of the accessory unit.

FIG. 10 is a perspective view of the accessory unit of FIG. 9, detachedfrom the telephone.

FIG. 11 is a picture of the accessory unit of FIG. 9, disassembled fromthe mounting.

FIG. 12 is a schematic diagram of a landline transmission system for ahome telephone.

FIG. 13 is a schematic diagram of a board system implemented in anaccessory unit of a home telephone utilizing the system of FIG. 12.

FIG. 14 is a schematic diagram of server software for the system of FIG.12.

FIG. 15 is a flow chart for an audio data parametric optimization andcompression algorithm.

FIG. 16 is a schematic diagram of a protocol for a data transmissionmethod with error correction delivery for a digital cellular telephone,illustrating individual packet acknowledgement for a full-duplex channelcase.

FIG. 17 is a schematic diagram of a protocol for a data transmissionmethod with error correction delivery for a digital cellular telephone,illustrating single acknowledgement for all packets for a half-duplexchannel case.

FIG. 18 is a schematic diagram for a media file monitoring system.

MODE(S) FOR CARRYING OUT ME INVENTION

The detailed description set forth below in connection with the appendeddrawings is intended as a description of presently-preferred embodimentsof the invention and is not intended to represent the only forms inwhich the present invention may be constructed and/or utilized. Thedescription sets forth the functions and the sequence of steps forconstructing and operating the invention in connection with theillustrated embodiments. However, it is to be understood that the sameor equivalent functions and sequences may be accomplished by differentembodiments that are also intended to be encompassed within the spiritand scope of the invention.

The present invention uses a unique method for delivery, storage, andplay back of sound and image files which include songs, musicalcompositions, or other sound recordings, cartoons, movies, televisionshows, or any other type of performance, as well as personal clips(e.g., personal sound recordings, family photos, home movies, etc.).This method includes the use of sound and or image clips as alerts for avariety of electronic equipment, and provides the consumer with a uniqueway of accessing these files which may be Internet based or independentof the Internet.

The present invention may include a number of modules for an overallsystem of delivery of music and audio/visual files. These modulesinclude a server of the files accessible by way of a specialized websitefor viewing, selecting, sampling and downloading selected files orportions thereof or directly accessible without going through a website.A telephone, be it conventional, cell phone or other hand held devicewith access to a communication network can access the server eitherdirectly or through the website. Special algorithms allow the transferof the files to the handset providing the high gravity recording in afile formal which allows for tracking and security against unauthorizedreproduction. The individual elements of the invention are unique aswell as the overall system of delivery tracking and security. Describedbelow are more detail aspects of the invention and its use.

Use of Sound and/or Image Clips as Alerts for Electronic Devices

According to one embodiment the system allows for sound and/or imageclips which are snippets of a musical and/or visual performance piece tobe used for sound and/or image alerts in electronic devices. A libraryor collection of uniquely edited clips may be provided to the consumerfor browsing and selecting files to be stored on the electronic device.The consumer may also use home made personal clips (e.g. personal soundrecordings, family photos, home movies, etc.).

The sound and/or image clips may be lifted from CD's, movies, TV shows,and the like, and are actual recordings, which may include human voice,instrument sound, and other sound effects, rather than mere electronicchimes or tones as those produced by conventional cellular phones.Electronic devices which may utilize sound and/or image clips as alertsinclude, but are not limited to cellular phones, land line phones,computers, clocks, watches, pagers, door bells, car alarms, palm pilots,and personal calendars. It should be understood that although usingclips for alerts is preferable, full unedited files may also be used.

According to one embodiment, real music sound clips are used to “ring” acellular or home phone. A clip or series of clips, which the user canselect, are played instead of the conventional electronic chime or ringtone. Such a system may be implemented on conventional cellular phones,which may be analogue or digital, by downloading firmwear comprisingalgorithms for delivery, storage, and playback of the sound files, tothe RAM element of the phone. Such algorithms include a delivery methodalgorithm, a parametric optimization and compression algorithm, and anerror correction algorithm. Alternatively, an accessory unit thatattaches to the cellular phone for implementing the system of thepresent invention may be provided.

According to another embodiment, sound and/or image clips are used forcomputer alerts such as e-mail notification sounds. Clips may also beused to ring a doorbell. Sound clips may further be used by a clock orwatch to sound at the beginning of each hour, similar to a grandfatherclock, wherein a different sound clip may be played at each hour.

The present invention allows the user to store hundreds of differentalert clips on a device. Unlike conventional electronic equipment whichhold a limited number of selectable alerts, such as a conventional caralarm or music player alarm clock, the present invention allows the userto choose from an unlimited number of clip files including allowing theuser to create his own alert clips or to choose from a library ofuniquely selected and/or edited files, including samples taken fromCD's, movies, television shows and the like.

A cellular phone, or similar device (having a processor, RAM, and flashelements) may be integrated with software at the time of manufacturingfor implementing the system of the present invention. Alternatively, achip may be embedded into the device or an accessory unit, including aspeaker, which attached to the device for implementing the system of thepresent invention may be provided. The accessory unit may have anadapter connection to the device. Such accessory unit may be sold withseveral adapter outlets to enable it to fit onto a variety of differentelectronic devices.

Sound and/or image clips may be pre-stored on the electronic device oraccessory unit at the time of manufacturing, such that the consumer maybe able to used the clips for alerts, without first having to select anddownload clips.

A user of an electronic device, according to the present invention, maydownload and store a number of clips off of a website via a plug-inconnection of the device to the computer, or via a wireless networksystem such as the Apples Airport. Additionally, a non-Internet basedholding system, which may be especially adapted for delivering clips tothe electronic device or accessory unit may also be provided. Suchholding system may be accessed via a phone dial in connection wherein auser may interact with the holding system by using the phone keypads orvoice commands. Other controls for interacting with the holding system,such as control buttons, voice commands or text keypads, may be providedon the accessory unit or the electronic device itself which may beespecially adapted for interacting with such holding system. Theaccessory unit or electronic device itself may also be soldpreprogrammed with embedded clips for demonstration use.

Additionally, such electronic devices may be capable of receiving orsending clips directly from one device to another device. To preventtransferring of entire files from one device to another, a securityfeature may be included on the devices and work in conjunction with thefile.

One method of preventing the transferring of files is to encode eachelectronic device or accessory unit with unique scrambling/unscramblingwave capabilities. As such, when a user transfers an entire me to hisdevice, say a cellular phone, for which he pays a fee, a scramblingwave, which may be a function of his unique telephone number, may beencrypted in the file. Upon playing the file, the user's cellular phonesends the corresponding unscrambling opposite wave. Other devicespurchased by the same user may also include the uniquescrambling/unscrambling wave encryption capabilities associated with theuser's telephone number. As such, the files may only be played withclarity on the device or devices owned by the user, even if such filesare transferred to other devices.

The security lock mechanism allows the original music or itsrepresentative to control distribution of music, and also provides anopportunity for music distributors to keep track of who plays theirmusic. As such, a method of accounting for royalty payments to artistsand performers and other parties registered with performing rightsorganizations such as ASCAP and BMI may include providing a trackingfeature on electronic devices used by businesses such as bars,restaurants, and clubs to play music. In addition to allowing a recordto be kept as to which music files have been downloaded and stored onthe electronic device, the tracking feature may also record informationon how many times and when each song has been played. This allowsperforming rights or music writers organizations the ability to keep anaccurate record on which to base royalty payment distributions.

An electronic device having stored sound or image clips may includevarious features which allow the user to preprogram the clips to play ina set sequence or a random order. (For example, certain clips which maybe from the same or different songs may play in a congruous orback-to-back order with a fixed silence time between the clips.)Additionally, the device may have features allowing the user to classifyand arrange the clips based on categories such as the type of clip(i.e., movie, song, etc.), artist name, time period, etc.

Thus, a user of an electronic device utilizing the clips according tothe present invention will be able to arrange the clips either through awebsite from which the clips can be downloaded onto the device, orthrough the device itself.

Additionally, after listening to or viewing a clip, the user can chooseto download the entire file from where the clip originated (i.e. theentire movie, song, etc.). The server providing the clips and the filesmay provide the clips for free or for a small fee as inducement for theconsumer to ultimately download the entire file for which a greater feemay be charged.

An electronic device according to the present invention may also havethe ability to receive clips which are directly transmitted onto theelectronic device via audio or visual broadcasts. The user of anelectronic device may program the device to sound a specified broadcastas an alert sound. For example, a sound segment from a live radio show(i.e., a sports show or a commercial) may be used to ring a cellularphone by either the caller or the callee.

Advertisements may also be transmitted through the electronic devicesaccording to the present invention. A message such as “pick up the phoneand don't forget to drink Coca Cola™” may be used to alert of anincoming call. Such transmitted advertising messages need notnecessarily function as alerts.

Additionally, this invention contemplates the use of image and soundclips which can be combined such that the user can create a unique clipof both sound and image for use in electronic devices having displayscreens. For example, a phone having an appropriate display screen canbe preprogrammed to display a visual clip of a caller accompanied bysound, or a computer alert may display an image clip with sound. Animage clip may comprise a single image frame or a picture clip as wellas an animation.

Website to be Used as a Support Tool for Downloading Clips to ElectronicDevices and Method for Selling

According to a preferred embodiment, a website for downloading soundand/or image clips holds a library of clips, each clip having a specificidentifying code or icon which may include, for example, the title of amusical composition or movie from where the clip originated, the name ofthe artist, a code number, or other type of identification depending onthe type of clip. For example, a song clip may be listed asbarrywhite@lovestuff.wav, or may display the picture of the song artistor CD cover of the CD on which the song appears, along with the name ofthe song. The list may be organized according to the artist's name, bymusic classification (i.e., pop, jazz, R&B, hip hop, etc.), by length ofthe sound clip, by the type of sound clip (i.e., song, piano music,guitar music, loud, quiet, etc.), or any combination of these categoriesor other conventional categories depending on the type of clip (image orsound). The website may also include categories of longer clips whichmay be more suitable for phone rings, and shorter clips which may bemore suitable for computer alerts. The website may further contain asuggested list of weekly or daily favorite clip picks, which may beprovided for each category or subcategory. Additionally, items orsubcategories in a given category may be organized alphabetically, byyear of copyright, or any other conventional order.

Tables 1 and 2 are examples of possible arrangements for sound clipsusing music classification and artist name. Note that the listings ofTable 1 such as barrywhite@lovestuff.wav are not websites, but usesymbols associated with web use, such symbols being one of the manyarbitrary ways of listing the clips. The symbol-driven website-likelistings may end with other non-domain suffixes such as “.pop”, “.song”,etc.

Additionally, this invention contemplates the use of website hyperlinksassociated with each listing as shown below in Tables 1 and 2.

TABLE 1 R&B Jazz Rock Barry White Herb Albert Chicagobarrywhite@lovestuff herbalpert@sunspots.pop chicago@fellings.wavbarrywhite@deepvox herbalpertgdatingame.pop chicago@time.wav

TABLE 2 R&B Jazz Rock Barry White Herb Albert Chicago BW-01 A-Sexy HA-01A-Date C-01 A-Begin BW-02 A-Love HA-02 A-Bull C-02 A-Search BW-03A-Peace HA-03 A-101 C-03 A-Color

The clips may also have an identifying number associated with each clip.Such identifying numbers may be used in downloading the clips to anelectronic device using a telephone (described below) or other devicehaving a number keypad. The website may further include a virtualpersonal locker or storage area for storing a selection of clipspersonal to a user which can be accessed on the website by a unique useridentification name or code. As such, a user can store clips for laterpurchasing, downloading to the user's cellular phone, playing, etc. Thewebsite may also allow the user to upload personal clips such as familyphotos, voice recordings, home movies, and the like, to the storagelocker for later downloading to the user's cellular phone or otherelectronic equipment. The storage locker may include an organizer forstoring the clips in alphabetical order, by various categories, or anyother order.

The website may allow for direct downloads of the clips from the websiteto the computer itself or to other electronic devices.

To illustrate how downloading through the website may be carried out, auser operating the computer may drag his/her mouse over the variouslisted sound or image clips and click on one or more selected clips.Thereafter, a box can appear prompting the user to select theappropriate electronic device onto which the clip or clips are to bedownloaded (e.g., the box may say “CELLULAR PHONE DOWNLOAD OR COMPUTERALERT DOWNLOAD?” Assuming that the “CELLULAR PHONE DOWNLOAD” button isselected, a prompt for typing in the appropriate cellular phone numberwill follow. Thereafter, the selected clip or clips may be uploaded tothe user's personal locker and made available for downloading to theuser's handset.

Other features may also be included, such as an option allowing the userto arrange multiple downloads in a specific order, create a folder forgrouping multiple downloads, or a feature incorporated into the phonewhich causes it to ring a selected clip immediately after it has beendownloaded. Additionally, clips, which have been previously downloadedto the phone may be deleted, rearranged, or reclassified with or withoutusing the website. (There are other methods for storing clips on anelectronic device such as a cellular phone, some of which are describedbelow, including direct downloading access for phones without the needfor web phone access capabilities).

Alternatively, by clicking “COMPUTER ALERT DOWNLOAD,” the selected clipwill be downloaded to the user's computer, allowing the user to selectmany different sound and image clips for computer alerts, such as e-mailnotification, computer alarm clock, and computer calendar notification.

Additionally, multiple clips may be associated with one type of alert,such that a different clip is played for each alert event. The user willbe able to rotate alert clips and preprogram or randomize their ordersimilar to a CD stereo carousel.

The user will also be able to hear or view a selected clip which willplay on the website upon the user's command. Browsing capabilitieswherein the user can drag his mouse over the sound clips library ofselections and hear the clips without having to click or open a file maybe included in the website. According to a preferred embodiment, a usersimply drags his mouse over various clip samples, which light up orflash and play one at a time. Any time the user places the cursor over acategory of music, the first tune in that category plays, and the iconrepresenting that category of music switches to display the name of theartist and title of the song or composition being played. Once the userclicks upon that icon, he can select the next song and hear the songwhile at the same time seeing the name of the artist and song title. Theuser can cycle through all the songs within that category using thisapproach very quickly to not only browse but to also hear the music. Ifthe user does not wish to switch over to another category of music, hesimply moves the cursor to another icon and repeats this procedure. Toselect a particular song the user double clicks on the song, which isthen included in a collection of selected songs to be downloaded later.

The website may be used as a shopping forum where consumers can hear orview the clips and click to buy items associated with the clips such asmusic records, cassette tapes and CD's, DVD's, and movie videos, ordownload the entire sound or image file to their computer for a fee. Byallowing the user to sample and download clips for use as alerts inelectronic devices, the website will provide an attractive forum forselling items associated with the sound and image clips, and forallowing the user to download the entire file associated with the clip,for which a fee may be charged.

Additionally, an identifying mini icon such as the song title orrecording artist CD icon associated with a clip or with a group of clipsmay appear on the computer screen at a fixed location and/or at thescreen display where the clip plays a computer alert. The icon mayinclude a “buy” button which will allow the user to purchase an itemassociated with the clip, or download the entire file from which theclip originated by clicking on the button. Such “buy” button may be ahyperlink to a website for transacting the purchase. Where a CD icon isnot used, the user may click on the song title to purchase an itemassociated with the clip. If the clip comes form a song that exists onmore than one CD, the customer will see more than one CD cover to choosewhich CD to buy.

The utility of clips as alerts for electronic equipment will provideconsumers with incentive to browse the website and sample the clips.After hearing or viewing the clip, consumers may be induced to purchaseitems associated with the clips, which they will be able to doinstantaneously through the website by the click of their mouse.

The website may further be used as a contest forum. The website may beset up to play mystery clips or short segments of sound recordings whichcontestants will have to identify in order to win a prize (i.e., bybeing the first to e-mail or call with the correct answer). Thus, aradio show may set up a game where a short segment of a sound clip isplayed on the radio or user's phone for contestants to guess and is alsoavailable for the listeners who want to hear it again on the website.

The website may include forwarding capabilities, such that a sound orimage clip can be forwarded as a greeting to a friend. (The securityfeature may be used only to prevent transferring of entire files).Consequently, the website will attract customers for the purpose ofdownloading clips to electronic devices and ultimately purchasing itemsassociated with the clips. Additionally, the user may subscribe to aservice such as an existing cell phone service provider for downloadingfiles through their telephone, without having to be online.

The user may also create a clip (e.g. by recording a song or personalclip) and store the clip onto a sound storage element in the electronicdevice. Additionally, the electronic device can be preprogrammed withclips selected by the manufacturer retailer of the device.

Accessing of Sound and/or Image Files without Access to Internet

Sound and/or image files which include clips may be downloaded withoutuse of the Internet by allowing a user to access a library of clips viatheir cellular phone or home telephone or providing other electronicdevices with features which allow automatic access to the library.(Although Internet free accessing will be described with respect to atelephone, it is to be understood that the method describe may becompatible with any electronic device preferably having accessingcapabilities similar to a telephone).

The library may be a non-web holding unit that has files with associatedcodes which match the codes associated with the files on the website,wherein the website serves as a usable guide for identifying variousfiles according to associated codes, such as numerical codes to assistthe user in downloading files using voice commands or keypad commands.

Additionally, cellular phone or home telephone users may access anon-web holding unit with a library of stored files which can similarlybe browsed, selected, and downloaded onto the phone using user voicecommands, key pad commands, or by connection to a live operator. Suchunit may be accessed by dialing a phone number (e.g., an 800 number).Home telephones and cellular phones may have separate holding units,such as a satellite for cellular phones and a ground unit for homephones, or a satellite can be used by cellular phones to access a groundholding unit.

To facilitate selection of the files from such holding unit, the accesssystem may provide for a code associated with each file which may beobtained by browsing the website as described above. As such, a userconnected to the holding unit would access the code associated with thefile to select and download the file to the user's telephone.

Many other methods allowing a user to select files from the holding unitare possible. For example, the telephone may include a voice recognitionfeature, wherein the user can say the name or part of the name of thesong he wishes to select (e.g., “Strawberry Fields” or the name of thesong artist). The phone may also utilize hierarchical submenus wherebythe user may press dial keys with letters corresponding to a selectionin a given category which ultimately leads to the selection of aparticular song. A phone having a screen display for providing a textlisting of the names of songs or categories, according to hierarchicalsubmenus, may also be used for enabling the user to narrow down to alist of songs and/or artists from which he can make his final selection.

A telephone may likewise be used to deliver files stored on the phone toa website, an e-mail address, another telephone, or other electronicdevice. Sound clips, which are segments of whole songs, musicalcompositions or other sound recordings, will be used mostly withtelephones, however, downloading entire music or image files may also bedone, subject to the security feature described above. Additionalrevenues may be generated as the consumer accesses the content libraryand uses airtime while browsing and downloading clips or entire songsfrom the library holding unit.

Furthermore, cellular phone and home telephone service providers mayoffer extra features to phone subscribers which would allow thesubscribers to download and store sound files for use with the telephonein accordance with the present invention. Such features by serviceproviders may include a personal sound file storage box (which may be afile of clips and/or entire sound files) that the user can access via apersonal code. The user may be charged a monthly fee for a subscriptionto the service, and/or per downloading of each song, whether or not theuser is a subscriber. Additional revenue can be generated by the serviceprovider even if the service is provided without a special charge sinceconsumers will use more airtime.

Telephone Using Sound Clips

A telephone having stored sound clips which may comprise real musicincluding human voice, various instrument sounds, and other soundeffects may allow the user to select one sound clip or a rotation ofseveral clips to “ring” the phone. Although it is preferable to ring thetelephone with sound clips, an entire music file may also be used,whereby for example, a song may start playing and continue until theuser picks up the telephone. (Of course, entire music files may beplayed on the telephone solely for the user's listening pleasure). Thetelephone may also be programmed to ring a conventional chime if theuser so chooses. Such a telephone may utilize a storage chip carryingstored sound clips as well as the conventional phone chime programmedonto it.

Additionally, the telephone may allow the user to determine how manytimes a clip is repeatedly played for each ring, and the time delaybetween clips in a given ring. The user may also choose to mix differentclips in one ring. A telephone may also include a looping feature whichrings the telephone in a looped clip such that the clip plays repeatedlywithout a pause between repetitions of the clip, or a “cluster” featurewhich rings a “cluster clip” comprising a multiple number of clipsegments from a single song, musical composition, or other soundrecording played in sequence.

Other features will allow the telephone user to preprogram the telephoneto play a certain clip when a specific individual calls, therebyallowing the user to identify the caller based on the chosen sound clip.Each person who regularly calls the user may have a unique identifyingring. This will allow a telephone user to have the option of assigning aunique sound caller ID to each of an unlimited number of callers. Otherfeatures may include allowing a caller to select his own personal soundclip to “ring” the telephone of the recipient of the call. (For example,the caller may sing or record a “Happy Birthday” song.) Also, atelephone used by more than one user may utilize sound clips for acallee ID function wherein the caller identifies the intended callee(e.g., by dialing a digit or sequence of digits) and the telephone playsthe clip associated with the callee.

Additionally, a telephone may be provided with a “caller messagerecorder feature” which allows the caller to record his/her own messageto send to the number dialed. For example, the caller may send a messagesuch as “Hey John. It's Mary. Pick up the phone,” by pressing a “recordring” button on his/her phone to send such a message to John's phone. Asdescribed earlier, the telephone user, say John, may have a caller IDfeature such that when a certain caller, say Mary calls, the telephonerings with a predetermined message or sound clip selected by John. Theadditional caller message recorder feature may cause John's telephone toplay Mary's message instead of overlaying the predetermined caller IDmessage or clip. Additionally, John may record his own message such as“It's Mary” and associate that recorded message with Mary's phone numberfor a caller ID ring.

A telephone, according to the present invention, may also include a“sensory feature” for enabling the telephone to sense the level ofambient noise and adjust the loudness of the “ring” accordingly. If thephone “senses” very loud background noise, for example, a cellular phonelocated in a loud restaurant, the ring volume will increase. A featurefor detecting whether a cellular phone is located in a pocket book or aplace where the “ring” sound may be muffled is also contemplated. Thisfeature will also enable the phone to adjust the volume of the “ring”such that the “ring” will be loud enough for the user to hear. Suchdetection mechanism can be achieved by detecting ambient light and otherconditions.

Additionally, the user may manually adjust the volume of the ring via atunable volume control mechanism or a multiple fixed settings control.(Although the above features are described with respect to a phone, itis to be understood that these features may also be provided with otherelectronic devices utilizing sound and/or image clips as alerts whereapplicable).

Transmission System for Delivery Clips to a Telephone

FIG. 1 is a schematic diagram illustrating the basic components for awireless transmission system 100 for a telephone 102, having a wirelessor landline service provider.

The system is described in terms of two main components: a storage chip104, and a server 106. The storage chip 104 is an element associatedwith the telephone which may be embedded into the phone or into anaccessory unit which attaches to the phone, having abilities tointerface with the phone. The existing hardware of a cellular phone mayalso be integrated with a software system which may be downloaded to theRAM element of the cell phone for incorporating the present invention,without the need for extra hardware. As such, the existing hardware ofthe cellular phone may be made to perform the same function of the chip.

The purpose of the chip 104 is to store a selection of clips, allow fordownloading of clips to be stored on the chip 104, and allow for theplayback of clips, either by the telephone or the chip 104. (Althoughthe description herein refers to sound clips, it is to be noted thatentire sound files may be stored, downloaded, and played, according tothe system described). Additionally, the chip 104 can associate thestored clips with a caller ID so that the particular clip to be playedback is determined by the calling subscriber ID.

The server 106, which is associated with a collection of stored clipfiles 108, is designed to execute requests of the chip 104, which may begiven through user voice commands or commands using the phone keys. Theserver may be equipped with a voice adapter no for supporting the ITU-TY.253 standard and telephone lines attached to the voice adapter. Thevoice adapter can also support some standard modem protocols, like Y.32or Y.34, if required for compatibility.

The server 106 also allows for files to be transmitted to the chip 104for storage. The system 100 enables a connection to the server 106 upona request from the chip 104, utilizing the phone, and/or PSIN (PublicSwitched Telephone Network), and/or a voice card (voice modem) attachedto the server computer.

The system may have a voice menu, which, after connection to the server106, allows the user to listen to the server's menu and navigate throughthe system of voice menus using the telephone's Dual Tone MultiFrequence (DTMF) keys. The system may allow the user to select anddownload clips by allowing the user to listen to the clips presented bythe server 106, select a clip, and issue a download command to theserver 106. The server then sends the selected clip (e.g. in digitalcompressed form) using a Custom Data Transmission Protocol (CDTP) overthe voice channel. (Illustrated in FIGS. 2 and 12).

The system 100 allows for storage of a large number of clip files in thechip's memory. The system's server 106 utilizes a music compressionalgorithm, shown in FIG. 15, which converts common music files intocompressed files that are downloaded and stored by the chip 106. Forexample, a chip supporting the storage of about 1000 clips, each beingapproximately a few minutes in length, may have a flash memory size ofabout 40 Mbytes. The chip 104 may also have a sound output element suchas speakers.

The server comprises software which can run under Windows 98, Windows-NTOS, or other suitable system using a voice modem for communications.Additionally, the system may use a single modem or a pool of severalmodems.

Preferable Embodiments for a Telephone System

Examples of telephone systems utilizing the method of the presentinvention include a cellular phone which may utilize an analogue(voice-only) system or a digital system, and a conventional land linetelephony network. A system for using a cellular network infrastructureis shown in FIG. 2. A schematic diagram of a landline transmissionsystem for a home telephone is shown in FIG. 12. (Again, although thefollowing descriptions make reference to the use of sound clips, it isto be understood that entire sound files may also be used as described).

All described examples assume existence of a server preferably dedicatedfor servicing user requests and providing sound clip data downloadcapabilities. A corresponding chip, implementing all required functionsis associated with the telephone.

The server may be a computer running Microsoft Windows or other suitableenvironment, such as a Pentium PC, Win95/98/NT/2oo0, 128 Mb RAM, 4 GBHDD. The server may store or be capable of accessing a sound clipdatabase, which may be stored on a website or non-web holding unit. Thesound clip database is stored in a compressed file format of thosecommonly known.

A schematic diagram for a server software system is shown in FIG. 4 fora cellular phone system, and FIG. 14 for a landline system. The softwaremay be written in C++ under Microsoft Windows or other suitablelanguage. The functions of the server software include servicing userrequests via a user interface element and transmitting a selected soundclip through the phone line via a music clip transmission element.

According to a preferred embodiment, the functions of the user interfaceelement include decoding DTMF keys pressed by the user and playing thevoice menu labels to the user. The voice menu interface may includehierarchical submenus, leading to different functions. In all examples,the user interface element can be unified in the sense that the voiceinterface and DTMF or voice recognition-based interface are independentof the type of network or type of phone(s) used. Implementations thatutilize a single server to process requests originating from differenttypes of networks and/or phones can therefore be built.

The system of submenus leads a user to the downloading of the selectedsound clip. Thereafter, control is transferred to the music cliptransmission element for downloading sound clips into the phone. Themusic clip transmission element interfaces directly with the phoneaccessory unit, independent of the user. The music clip transmissionelement is dependent on the type of the phone used and the networkinfrastructure.

Example Transmission System for a Cellular Phone and Network

FIG. 2 is a schematic diagram of a wireless transmission system 200 fora cellular phone 202, which may be either an analogue (voice-only) ordigital system. In both cases, a specialized board 203, implementing allrequired functions, similar to the chip 104, is incorporated in anaccessory unit 204 attached to the cellular phone. Although the system200 is described as incorporating an accessory unit, it should beunderstood that a chip performing the same functions of the board mayinstead be embedded in the phone itself, or a software system may beintegrated with the existing hardware chip of a conventional cellularphone without the need for additional hardware. The system 200 furtherincludes a server 206 and software 207 for the server.

The cellular telephone 202 may be any commercially available cellularphone having capabilities for supporting a command set for generaltelephone control, [i.e., a Y.25 Ter serial asynchronous automaticdialing and control as recommended by the ITU-T (InternationalTelecommunication Union-Telecommunication sector)] and for supportingY.25 Ter “+C” extensions according to the ETSI (EuropeanTelecommunications Standards Institute) ETS300-916 standard forobtaining codes of keys pressed by the user and for receiving caller IDinformation. Additionally, the phone 202 should have capabilities forsubscribing to a cellular provider 208 with caller ID service support.

A schematic diagram of the board 203 is shown in FIG. 3. In anembodiment where an accessory unit is used the board 203 is implementedin the accessory unit 204 which can be attached to the phone 202 througha standard extension connector where other commercially availableaccessories such as a hands free ear set and charging adapter aretypically connected.

The board 203 includes the following main blocks: a Digital SignalProcessor (DSP) 300, a flash memory element 302, a Random Access Memory(RAM) element 304, an initial bootstrap chip 306, an analogue interfaceelement 308, and a digital interface element 310.

The processor 300 executes the device firmware, provides control for allother blocks and performs the computational tasks for the board 203. Thetasks performed by the processor 300 include control of the board'sunits, monitoring of keys pressed by the user and processing ofkey-press events, reception of information from the computer through thecomputer digital interface, reception of caller ID information throughthe phone digital interface, reception of packed sound clips through thephone analogue or digital interface, unpacking and then playing backsound clips through a built-in speaker connected to the analogueinterface of the accessory unit 204, support of a voice menu-driven userinterface, and performance of other auxiliary functions.

The flash memory element 302 contains the device firmware, and soundclips which can be pre-loaded as well as downloaded from the server. TheRAM element 304 enables the processor to run faster and also holdsbuffers for unpacked sound fragments and processor service procedures.When the power is turned on, the initial bootstrap chip 306 loads thedevice.

The analogue interface element 308 includes a phone interface element312 and a built-in speaker interface element 314. The phone interfaceelement 312 is used for input and output of signals when downloadingsound clips from the server 206. The speaker interface element 314, withthe speaker, plays all system sounds heard by the user including voicemenus and sound clips.

The digital interface element 310 includes a phone interface element 316and may include a computer interface element 318. The phone interfaceelement 316 is used for phone control and for receiving key codes andcaller ID information from the phone. The computer interface element 318is used for various service functions such as downloading preprogrammedsound clips from the computer to the flash memory.

The functions of the server software, shown in FIG. 4, include servicinga user's requests via a user interface element 402 and transmitting aselected sound clip through the phone line 404 via a music cliptransmission element 406.

A user interface element 402 is provided whose functions includedecoding DIMF keys pressed by the user and playing the voice menu labelsto the user. The voice menu interface may include hierarchical submenuswhich lead to the downloading of the selected sound clip. Thereafter,control is transferred to the music clip transmission element 406 fordownloading sound clips into the phone. The music clip transmissionelement 406 interfaces directly with the phone accessory unit,independent of the user.

The selected sound clip may be transmitted through the phone line to theaccessory unit 204 first through the server hard drive 408, then throughthe server software 207, next through the voice adapter 210, thenthrough the phone line of the network to the cellular service provider208, to the cellular phone 202, and through the analogue interface 308of the accessory unit 204, then through the processor 300 of theaccessory unit 204, and finally, through the flash memory element 302 ofthe accessory unit 204. When the sound clip transmission is completed,the task of the music clip transmission element is completed.Thereafter, the phone line 404 is released and control is transferred tothe user interface element 402.

In an autonomous mode, the board 203 may contain a number of pre-loadedsound clips. Initially, the board 203 is in the inactive state. Theboard 203 and phone 202 interact such that the phone sends to the boardcodes of all the keys pressed by the user. Upon receiving a particularsequence of codes or when, for instance, a particular key is pressed fora prolonged period of time, the board 203 switches to the active mode.In the active mode the board 203 may interact with the user via a voicemenu-driven interface where voice messages, via a speaker, prompt theuser to respond by pressing a selection of phone keys indicating theuser's responsive selections. The board 203 reacts to the user'sselections by analyzing the keys being pressed.

The clips are stored on an internal clip index which can be retrievedfrom the internal memory and played back according to key commandsprovided by the user. Examples of voice menu options provided by thedevice 204 through a speaker upon switching to an active modeinclude: 1) the user may choose to exit the active mode and enter thepassive mode (e.g., by pressing “0”); 2) the user may choose to listento the current sound clip on the clip index (e.g., by pressing “I”); 3)the user may choose to listen to the next clip on the index (e.g., bypressing “2”); 4) the user may choose to listen to a previous clip onthe index (e.g., by pressing “3”); or 5) the user may choose to assign acaller ID number to the current clip on the index (e.g., by pressing“4”).

Upon choosing to assign a caller ID phone number to the current clip,the device may switch to a sub mode with a sub-menu having the followingoptions: 1) the user may choose to switch back to the previous menu(e.g., by pressing “0”); 2) if the selected clip was already assigned,the user may choose to get information regarding the caller ID alreadyassociated with the clip (e.g., by pressing “I”); or 3) the user mayinput a new caller ID phone number for the current clip (e.g., bypressing “2” indicating this choice, then dialing in the phone numberfollowed by the “#” sign).

In the passive mode, the device 204 may analyze messages being receivedfrom the telephone 202. Upon receiving an incoming call, the device 204checks the incoming caller's phone number against the list of assignedcaller ID sound clips in its memory and plays back the particular soundclip through the built-in speaker if the specified caller's phone numberwas assigned to this clip. Alternatively, the device 204 may play back adefault sound if the particular caller ID was not assigned to any clip.

In order to transfer digitally compressed sound clip data through theanalogue channel a special method and algorithm to map digits to soundsis used. This method is implemented not only for a cellular telephoneusing an analogue cellular network but also for a landline transmissionsystem of a home phone, shown in FIG. 12.

Method for Data Transmission Over an Audio Channel of a WirelessTelephone

A data transmission method 500 for transferring data through the phoneline and the receiver, based on a voice mode connection (versus datamode) and DIMF signal interpretation is illustrated in FIG. 5. A similarapproach can be implemented for a landline telephone that does not havea data transmission mode.

For transmitting data through the phone line, the transmission method500 comprises the steps of a) data scrambling 502, b) data mapping 504,c) conversion of frequency symbols to time samples 506, d) addition ofcyclic prefix 508, and e) digital to analogue conversion 510. The datais then sent through the receiver, following the reverse steps of f)analogue to digital conversion 512, g) symbol synchronization 514, h)conversion of time samples to frequency symbols 516, i) decodingfrequency symbols to bits 518, and j) descrambling the data 520.

The transmission method is used to provide enough speed for the datatransmission. The transmission method allows simultaneous use of thevoice communication and data transmission features (during oneconnection session) without having to switch the mode of connection. Acustomer does not need to use a Wireless Internet Service Provider. Auser can simply place a regular call to the specific number (e.g., an“800” number) to gain access to the Server. The dual-mode connectionallows for voice and “push button” support as well as voice recognitionservice.

An orthogonal frequency-division multiplex (OFDM) modulation scheme isused for data transmission. The benefits of OFDM include: 1) themodulation can be made robust to Inter-Symbol Interference (IS!) byincreasing symbol size; 2) the modulation can be made robust to impulsenoise by increasing symbol size; 3) for each individual sub-channel, thechannel's response could be considered essentially flat, minimizing theneed for channel equalization; and 4) different encoding schemes couldbe used for different sub-channels, for accommodatingfrequency-selective channel distortions.

The total bandwidth to be used by the method is determined by the worstcase of supported audio channel. A suitable algorithm for compression ofthe voice channel is the GSM RPE-LTP algorithm which essentially has abuilt-in down-sampling by a factor of 3 in which the allowed bandwidthis limited at 4000/3=1333 Hz. Usually channel response is severelylimited at frequencies below about 200 Hz to 250 Hz. No OFDM symboltime-windowing is employed to minimize variations of transmitted signalamplitude envelope.

A compressed voice channel can also introduce significant non-lineardistortions. Therefore, it is not feasible to have a large number ofsub-channels; otherwise the algorithm would be affected by significantinter-channel interference (ICI) due to loss of orthogonality betweensub-channels. About 32 sub-channels appear to provide enough symbol sizewhile maintaining satisfactory low ICI.

Modulation Symbol Structure

Each OFDM symbol consists of a minimum number of samples sufficient torepresent all sub-channels. To increase computation efficiency, a FastFourier Transform is employed to convert sub-channel symbols fromfrequency to time area. Therefore, for 32 sub-channels, OFDM symbol sizeshould be at least 64 real samples (at 2666 Hz rate). A circular prefixof 16 samples is used to improve separation between symbols, andminimize ISI (Inter Symbol Interference) and ICI. Therefore, totalsymbol size is 80 samples at 2666 Hz.

Receiver Synchronization

Circular extension prefix redundancy, present in the signal, is used tofacilitate OFDM symbol synchronization in the receiver. Asynchronization subsystem effectively computes auto-correlationcoefficients of the received sequence (e.g., at 2666 Hz). The output ofthe correlator goes through a “rectifying” phase-locked loop-like systemwhich outputs synchronization impulses at the proper time instants tosample OFDM symbols correctly.

Synchronization system induced timing jitter leads to rotation ofreceived sub-channel phasors by increments, proportional to the centralfrequency of a particular sub-channel. This rotation is compensated inthe decision scheme.

Data Mapping

The output of a scrambler is mapped onto complex symbols(amplitude/phase) of the OFDM sub-channels. Individual sub-channels useQPSK (Quadrature Phase Shift Keying) modulation.

Data Scrambling

Data scrambling is employed in order to provide statistically randomdistribution of transmitted symbols to reduce peak-to-average powerratio of OFDM symbols. A self-synchronizing scrambler with generatingpolynomial of is 1+X1B+X23 is used which, at the transmitter,effectively divides the data sequence by the generating polynomial. Thecoefficients of the quotients, taken in descending order, form theoutput data sequence.

Example Using Analogue Cellular Network and Cellular Telephone

The above-described accessory unit 204 is provided in this example inthe context of analogue (providing only voice channel) cellular network.

Initially, the device 204 is in an inactive mode. A user dials theserver number and, navigating through a system of voice menus supportedby the server software, listens to and selects a particular sound clipin the same way as browsing the loaded sound clips in the autonomousmode. Instead of assigning a caller ID, the user may choose to downloadsound clips.

When a user, navigating through the server voice menus, selects todownload the current clip and in the embodiment using an accessory unitactivates the accessory unit 204 through the predefined key sequence,the process of transmission of the selected sound clip is initiated.After selecting a “download” option, the user may press a specific keycombination on the phone to switch the accessory unit 204 from theinactive to the active mode. The unit 204 then begins to interact withthe server 206, using the analogue channel provided by phone andnetwork. The already established phone connection is used to receiveinformation. The device may receive the sound clip selected by the userand download it into internal flash memory.

At the end of a session, the unit 204 forces the telephone 202 to hangup and switches to the autonomous mode which enables the user to assigna new caller ID to the sound clip just received. When a user, navigatingthrough the server voice menus, selects to download a clip and activatesthe accessory unit 204 through the pre-defined key sequence, the processfor transmission of the selected sound clip is initiated.

The selected sound clip is transmitted through the phone line to theaccessory unit 204 first through the server hard drive 408, then throughthe server software 207, next through the voice adapter 210, thenthrough the phone line of the network to the cellular service provider208, to the cellular phone 202, and through the analogue interface 308of the accessory unit 204, then through the processor 300 of theaccessory unit 204, and finally, through the flash memory element 302 ofthe accessory unit 204.

Generally, the server software 207 retrieves the selected sound clipfrom a database 212, converts it to the special sequence of soundsmodulates, transfers codes of these sounds to the voice adapter 210 thatconverts these codes to actual sounds and transfers these sounds to thephone line 214. From the phone line 214, the sounds go to a cellularprovider 208 through to a radio channel, and to the cellular phone 202itself (much like voice sounds are transferred during a normal phoneconversation). The sounds then go through the connector and are receivedin analogue form by the board 203. The sounds are then converted by thedevice ADC (Analog to Digital Converter) to the digital form and areprocessed by the DSP (digital signal processor-“demodulated”) 300 to thesame digital data form initially stored on the database 212 (e.g., inMPEG audio format). In this form, the sound clip data are written intothe flash memory 302 of the device 204.

Following a reverse direction, going from the board 203 to the server206 using the same chain, the device sends to the server either an“acknowledgement” of a successful delivery of the sound clip data or alist of data blocks received with errors so that these blocks can beresent in a second try. In order to transmit digital data through theanalogue channel, a similar procedure is used to convert data to soundsand back.

When all the data is transferred without errors, the board 203 signalsto the server 206 that the call may be disconnected. Thereafter, theserver 206 instructs the voice adapter 210 to hang up, freeing the phoneline for another client, and the board 203 switches to the autonomousmode, allowing the user to assign a caller ID to the sound clip mostrecently downloaded.

Example Using Digital Cellular Network and Cellular Telephone

The above-described accessory unit 204 is used in this example in thecontext of digital (capable of providing a dedicated data transmissionchannel) cellular network. Since in this case a digital channel is usedfor sound clip data transmission, no modulation is required on themobile phone side. The server, on the other hand, uses a modulationprotocol compatible with the protocol supported by the cellular networkprovider. Usually this can be accomplished by using a standardITU-T-approved modem, like Y.32 or Y.34.

Initially, the unit 204 is in an inactive mode. A user dials the servernumber and, navigating through a system of voice menus supported by theserver software, listens to and selects a particular sound clip in thesame way as browsing the loaded sound clips in the autonomous mode.Instead of assigning a caller ID, the user may choose to download soundclips.

When a user, navigating through the server voice menus, selects todownload the current clip and activates the accessory unit 204 throughthe pre-defined key sequence, the process of transmission of theselected sound clip is initiated. After selecting a “download” option,the user may press a specific key combination on the phone to switch theaccessory unit 204 from the inactive to the active mode. The unit 204then begins to interact with the server 206, using the digital channelprovided by the phone and the network. If possible, the alreadyestablished phone connection is used, or a new connection is establishedspecifically for digital data transmission.

The selected sound clip is transmitted through the phone line to theaccessory unit 204 first through the server hard drive 408, then throughthe server software 207, next through the voice adapter-modem 210, thenthrough the phone line of the network to the cellular service provider208, to the cellular phone 202, and through the digital interface of theaccessory unit 204, then through the processor 300 of the accessory unit204, and finally, through the flash memory element 302 of the accessoryunit 204.

Generally, the server software 207 retrieves the selected sound clipfrom a database 212, transfers codes of these sounds to the voiceadapter-modem that converts these codes to actual sounds, using one ofthe standard modulation protocols supported by the cellular provider(like ITU-T Y.32 or Y.34) and transfers these sounds to the phone line214. From the phone line 214, the sounds go to a cellular provider 208,where they are demodulated back into digital sends to the server eitheran “acknowledgement” of a successful delivery of the sound clip data ora list of data blocks received with errors so that these blocks can beresent in a second try. In order to transmit digital data through theanalogue channel, a similar procedure is used to convert data to soundsand back.

When all the data is transferred without errors, the board 203 signalsto the server 206 that the call may be disconnected. Thereafter, theserver 206 instructs the voice adapter 210 to hang up, freeing the phoneline for another client, and the board 203 switches to the autonomousmode, allowing the user to assign a caller ID to the sound clip mostrecently downloaded.

Example Using Digital Cellular Network and Cellular Telephone

The above-described accessory unit 204 is used in this example in thecontext of digital (capable of providing a dedicated data transmissionchannel) cellular network. Since in this case a digital channel is usedfor sound clip data transmission, no modulation is required on themobile phone side. The server, on the other hand, uses a modulationprotocol compatible with the protocol supported by the cellular networkprovider. Usually this can be accomplished by using a standardITU-T-approved modem, like Y.32 or Y.34.

Initially, the unit 204 is in an inactive mode. A user dials the servernumber and, navigating through a system of voice menus supported by theserver software, listens to and selects a particular sound clip in thesame way as browsing the loaded sound clips in the autonomous mode.Instead of assigning a caller ID, the user may choose to download soundclips.

When a user, navigating through the server voice menus, selects todownload the current clip and activates the accessory unit 204 throughthe pre-defined key sequence, the process of transmission of theselected sound clip is initiated. After selecting a “download” option,the user may press a specific key combination on the phone to switch theaccessory unit 204 from the inactive to the active mode. The unit 204then begins to interact with the server 206, using the digital channelprovided by the phone and the network. If possible, the alreadyestablished phone connection is used, or a new connection is establishedspecifically for digital data transmission.

The selected sound clip is transmitted through the phone line to theaccessory unit 204 first through the server hard drive 408, then throughthe server software 207, next through the voice adapter-modem 210, thenthrough the phone line of the network to the cellular service provider208, to the cellular phone 202, and through the digital interface of theaccessory unit 204, then through the processor 300 of the accessory unit204, and finally, through the flash memory element 302 of the accessoryunit 204.

Generally, the server software 207 retrieves the selected sound clipfrom a database 212, transfers codes of these sounds to the voiceadapter-modem that converts these codes to actual sounds, using one ofthe standard modulation protocols supported by the cellular provider(like ITU-T Y.32 or Y.34) and transfers these sounds to the phone line214. From the phone line 214, the sounds go to a cellular provider 208,where they are demodulated back into digital showing three data blocks1600, 1602, and 1604 for corresponding data, indicated in the figure as“Data 1” “Data2” and “Data3”, with corresponding headers “Hdr1”, “Hdr2”,and “Hdr3”. Positive acknowledgements “Ack1” and Ack2″ are sent forpackets numbers 1 and 2. Packet number 3 is originally received with anerror, indicated by “Nack” and is subsequently re-sent to successfullycorrect the error, whereby a positive acknowledgement “Ack3” is sent.

If the phone or network supports only unidirectional data transmission,the other protocol can be utilized to minimize the number of channeldirection alterations. In this case, all data blocks for the sound clipare sent at once by the server, without receiving acknowledgements forthe individual packets. Then, a single control packet is transferred inthe opposite (device to server) direction. This control packet containsa bit mask, with one bit for each data block received. Each bit in thebit mask has a “1” value if the corresponding data block was receivedwithout errors, or a “0” value if the corresponding block was affectedby errors. The server then re-sends those blocks that were received witherrors in the first pass. When all data blocks are transferred, and theacknowledgement mask without errors indication is received by theserver, the sound clip is considered to be completely delivered. Exampleof such interaction is shown in FIG. 17, where the packet number 2 isoriginally received with an error and is successfully re-sentsubsequently to correct the error. Note that only two “ACK” packets weresent during the entire procedure.

Example Using Landline Telephony Network and a Conventional HomeTelephone

FIG. 12 illustrates an example of a landline telephony network system1200 using a voice channel and a conventional home telephone U01,according to a preferred embodiment. (Although this system is describedwith respect to the accessing and delivery of sound clip files, it is tobe understood that the system may be used for the accessing and deliveryof entire files).

The system 1200 has two main components: a home telephone accessory unit1202 and a server 1205. The accessory unit 1202 is an autonomous unit,attached to the phone line 1208 and to the phone (between the line andthe phone), and powered from the AC power outlet. A chip performing thesame function of the accessory unit may instead be embedded in the phoneitself. The purpose of the accessory unit 1202 is to support selection,downloading, and playback of sound clips according to the Caller ID ofthe calling subscriber. The accessory unit 1202 may include a speakersystem and enables the phone to ring sound clips or perform otherfunctions as described for the cellular phone 102.

The server 1205, which is associated with stored clip files 1206, whichmay be stored on a website or a non web holding unit, is designed toexecute requests of the accessory unit 1202 either through user voicecommands or commands using the phone keys, and allows for files to betransmitted to the accessory unit 1202 for storage. The system 1200enables a connection to the server upon a request from the accessoryunit 1202, utilizing the phone and PSTN (Public Switched TelephoneNetwork), and an adapter 1204 (voice modem) attached to the servercomputer. The system 1200 may have a voice menu, which, after connectionto the server, allows the user to listen to the server's menu andnavigate through the system of voice menus using the phone's DTMF keys.The system may allow the user to select and download clips by allowingthe user to listen to the clips presented by the server, select a clip,and issue a download command to the server. The server then sends theselected clip (e.g., in digital compressed form) using the Custom DataTransmission Protocol (CDTP) over the voice channel.

The device 1204 may further interface with a home PC 1214 fordownloading sound files to the device. The interface may be a plug inconnection or may use a wireless network system.

The accessory unit 1204 may be sold as a unit compatible to most homephones including cordless phones, and may connect directly to the phonejack, with the phone connected to the device. Similar to the cellularphone 102, a home phone may include an embedded chip, instead of theaccessory unit 1204, for performing functions similar to those of theaccessory unit U04′

The handset of a cordless phone utilizing sound clips according to thepresent invention may ring simultaneously with the box, wherein thehandset may sound a regular phone ring or a sound clip ring, while thebox plays a sound clip ring.

The server comprises software shown in FIG. 14, which can run underWindows 98®, Windows-NT OS®, or other suitable system using a voicemodem for communications. Additionally, the system may use a singlemodem or a pool of several modems.

Initially, the accessory unit U02 is in an inactive mode. A user dialsthe server number and, navigating through a system of voice menussupported by the server software, listens to and selects a particularsound clip in the same way as browsing the loaded sound clips in theautonomous mode. Instead of assigning a caller ID, the user may chooseto download sound clips.

When a user, navigating through the server voice menus, selects todownload the current clip and activates the accessory unit 1202 throughthe pre-defined key sequence, the process of transmission of theselected sound clip is initiated. After selecting a “download” option,the user may press a specific key combination on the phone to switch theaccessory unit 1202 from the inactive to the active mode. The device1202 then begins to interact with the server 1205, using the analoguechannel provided by the telephone and the network. The alreadyestablished phone connection is used to receive information.

The selected sound clip is transmitted through the telephone line to theaccessory unit 1202 first through server 1205, next through the adapter1204, then through the telephone line of the PSTN to and through theanalogue interface 1305 of the accessory unit (shown in FIG. 13), thenthrough the processor 1301 of the accessory unit 1202, and finally,through the flash memory element 1302 of the accessory unit 1202.

A schematic diagram of a board 1300 implemented in the accessory unit1202 is shown in FIG. 13. The board includes the following main blocks:a processor element 1301 [e.g., a Digital Signal Processor (DSP)], aflash memory element 1302, a Random Access Memory (RAM) element 1303, abootstrap chip 1304, an analogue interface element 1305, and a digitalinterface element 1306.

The processor 1301 executes the device firmware, provides control forall other blocks and performs the computational tasks for the board. Thetasks performed by the processor 1301 include: control of the board'sunits, monitoring of keys pressed by the user and processing ofkey-press events, reception of information from the computer through thecomputer digital interface, reception of caller ID information fromtelephony service provider, reception of sound clips through the phoneanalogue interface, unpacking and then playing back sound clips througha built-in speaker connected to the analogue interface of the accessoryunit 1202, support of a voice menu-driven user interface, andperformance of other auxiliary functions.

The flash memory element 1302 contains the device firmware, and thesound clips which can be preloaded as well as downloaded from theserver. The RAM element 1303 enables the processor to run faster andalso holds buffers for unpacked sound fragments and processor serviceprocedures. When the power is turned on, the bootstrap chip 1304 loadsthe device.

The Analogue Interface element 1305 includes a telephone interfaceelement and a built-in speaker interface element. The telephoneinterface element is used for input and output of signals whendownloading sound clips from the server 1205. The speaker interfaceelement with the speakers, plays all system sounds heard by the userincluding voice menus and sound clips.

The digital interface element” 1306 may include a computer interfaceelement and other digital interface elements to the home network. Thecomputer interface element may be used for various service functionssuch as downloading preprogrammed sound clips from the computer to theFlash Memory.

In an autonomous mode, the accessory unit 1202 contains a number ofpre-loaded sound clips. Initially, the accessory unit is in the inactivestate. The accessory unit 1202 and telephone 1201 interact such that thetelephone 1201 sends to the accessory unit 1202 codes of all the keyspressed by the user. Upon receiving a particular sequence of codes orwhen, for instance, a particular key is pressed for a prolonged periodof time, the accessory unit 1202 switches to the active mode. In theactive mode, the accessory unit 1202 may interact with the user via avoice menu-driven interface where voice messages, via a speaker, promptthe user to respond by pressing a selection of phone keys indicating theuser's responsive selections. The accessory unit 1202 reacts to theuser's selections by analyzing the keys being pressed.

In the passive mode, the accessory unit 1202 may analyze messages beingreceived from the telephone 1201. Upon receiving an incoming call, theaccessory unit 1202 checks the incoming caller's phone number againstthe list of assigned caller ID sound clips in its memory and plays backthe sound clip through the built-in speakers if the specified caller'sphone number was assigned to this clip. Alternatively, the accessoryunit 1202 may play back a default sound if the particular Caller ID wasnot assigned to any clip.

A schematic diagram of the server software is shown in FIG. 14. Theserver software is used for servicing user requests through userinterface element 1401, and transmitting the selected sound clip throughthe phone line via music clip transmission element 1402. The userinterface element 1401 decodes DTMF keys pressed by the user, and playsvoice menu labels to the user. The voice menu interface includeshierarchical submenus to lead the user to the downloading of the desiredsound clip, where control is transferred to the music clip transmissionelement 1401.

The music clip transmission element 1401 downloads sound clips to thephone, independent of the user interface element, interfacing directlywith the phone accessory unit. The music clip transmission element 1402initially transmits the selected sound clip to the adapter 1404 for datatransmission from the server to the accessory unit. When the sound cliptransmission is completed, the task of the music clip transmissionelement is done, and the telephone line is released and control istransferred to the user interface element 1401

Generally, the server software retrieves the selected sound clip from aserver database 1403, which is associated with an audio dataoptimization and compression element 1405, converts the clip to thespecial sequence of sounds modulates, and transfers codes of thesesounds to the adapter 1404 which converts these codes to actual soundsand transfers these sounds to the phone line 1406. From the phone line1406, the sounds go through the PSTN and are received in analogue formby the accessory unit 1202. The sounds are then converted by the deviceADC (Analog to Digital Converter) to the digital form and are processedby the DSP (digital signal processor) 1301 to the same digital data forminitially stored in the server database 1403 (e.g., in MPEG audioformat). In this form, the sound clip data are written into the flashmemory 1302 of the accessory unit 1202.

Following a reverse direction, going from the accessory unit 1202 to theserver 1205 using the same chain, the device sends to the server eitheran “acknowledgement” of a successful delivery of the sound clip data ora list of data blocks received with errors so that these blocks can beresent in a second try. In order to transmit digital data through theanalogue channel, a similar procedure is used to convert data to soundsand back. When all the data is transferred without errors, the accessoryunit 1202 signals to the server 1205 that the call may be disconnected.Thereafter, the server 1205 instructs the adapter 1204 to hang up,freeing the phone line for another client, and the accessory unit 1202switches to the autonomous mode, allowing the user to assign a Caller IDto the sound clip most recently downloaded.

The server audio data optimization and compression element 1205,utilizes a music compression algorithm outlined in FIG. 15, whichconverts common music files into compressed files in order to reduce theaudio clip size for minimizing its download time, while maintainingpredetermined audio quality. These files are downloaded and stored bythe accessory unit 1202.

Preferred Procedure for Audio Data Parametric Optimization andCompression

The method 1500 of compressing the files comprises the steps of a)conversion 1502; b) amplitude normalization 1504; c) sample rateconversion 1506; d) pre-emphasis filtering 1508; e) amplitudenormalization 1510;

and t) performance of MPEG audio layer 3 (MP3) compression with theselected parameters 1512. The compressed files are then transferred tothe server database.

Step 1502 of conversion to mono only needs to be performed if the inputfile is in stereo and if the audio output subsystem of the targethardware is only capable of playing back mono audio. At this step allavailable information is included into the output audio by summing ofthe left and right channels to form a single mono output.

After conversion, or if the file does not need to be converted to mono.compression begins with the step 1504 of amplitude normalization,wherein sample amplitudes in the file are normalized. This step isrequired for enabling audio utilization of all available dynamic rangeand for improving the computational accuracy of subsequent steps. Inorder to maximize preservation of original audio range, a fixedcoefficient for the entire audio file normalization is used. Thecoefficient is obtained using input file analysis to “stretch” the inputaudio range over the maximum available range.

Step 1506 converts sample rate of audio files to selected samplingfrequency. The original audio clips may have various sampling rates(44100 Hz, 48000 Hz, 22050 Hz, 11025 Hz, etc.). After analysis ofavailable hardware capabilities an optimal sampling frequency, whichprovides the most adequate audio quality, is selected. Increasing thesampling frequency above the optimal sampling frequency would notsignificantly increase the perceptual audio quality, due to thelimitations of the audio output subsystem of the accessory unit. Forexample, for the cellular phone system of FIG. 2, after analysis ofavailable hardware capabilities and a series of perceptual tests, the22050 Hz sampling frequency was selected as providing the most adequateaudio quality since the audio output subsystem of the accessory unit hasa relatively sharp drop in response for frequencies above 10-12 kHz.

In order to avoid aliasing effects when changing from higher to lowersampling rate, a low-pass pre-filtering with a cutoff slightly lowerthan the new Nyquist frequency is applied before down sampling. Forrates that are not multiples of each other, cascaded sampling rateconversion schemes are constructed to minimize memory consumption andimprove performance.

The step 1508 of pre-emphasis filtering, along with the re-sampling ofthe previous stage, takes into account the specifics of the audio outputsubsystem of the accessory unit, to achieve improvement of theperceptual audio quality, and to reduce the resulting audio size aftercompression.

Since the speaker of the audio output subsystem of accessory unit ispreferably very small, the resulting sound has very low power in the lowfrequency range. Therefore, providing output in the low frequency rangeis likely to be futile, as it would only increase the size of audio filewithout any perceptual improvements. Additionally, providing output inthe low frequency range may create undesirable “overflow” effects forthe speaker.

For example, for the cellular phone system of FIG. 2, all frequencycontent below about 400 Hz is removed from the audio. In order to makethe audio more “perceptually rich” in the low-frequency range,frequencies around 600 Hz are increased by about +6 dB. The frequencyrange from 1200 Hz to 8200 Hz is kept unchanged. Then, starting fromabout 8200 Hz the signal power is gradually increased, up to +15 dB atthe highest frequency (11 kHz). This compensates for the drop in speakertransfer function at high frequencies and improves the listeningexperience.

A set of subjective audio perceptual tests with various types of audiocontents, using the wide spectrum of hardware of the target platform hasproved that the above-described pre-emphasis significantly improves theperceptual quality of resulting audio. At the same time, reducingfrequency contents in the “non-significant” frequency regions allowsreduction of the resulting compressed audio size, since the data bitsare not allocated to non-used frequencies.

The described pre-emphasis procedure is implemented by a filtering witha FIR (Finite Impulse Response) filter, according to the formula:

$y_{k} = {\sum\limits_{i = 0}^{N - 1}\; {b_{i} \cdot x_{k - i}}}$

where b_(i) are filter coefficients,x_(k) is the k-th output audio sample,Yk is the k-th output audio sample.

The b₁ coefficients are fixed and computed for the particular samplingrate and the desired pre-emphasis response curve. The filter can bedesigned to have a linear phase response (this is actually guaranteed ifthe b.sub.1 coefficients are symmetric), which would ensure absence ofphase distortions to the audio. Since the delay introduced by the filteris not harmful for off-line processing, the filter size can be maderather large to approximate the desired response curve with a highprecision.

After completing the step of pre-emphasis filtering, normalization ofthe sample amplitude is once again performed. Since the filteringsignificantly changes the signal, the second amplitude normalizationstep 1510 is required to convert resulting audio “loudness” to somepre-defined value.

Proceeding to step 1512, the processed audio clip is compressed into anMPEG Layer 3 bit stream. The resulting bit rate (level of compression)can be varied to suit different needs. For instance, it can be madedependent on the source audio clip length, to make the compressed filefit into a pre-defined size. Alternatively, it can be made dependent onthe anticipated delivery method (to create, for instance, a “built-in”audio clip of a very good quality, or to make the audio clip of a verysmall size, for delivery through a slow channel). The compressionparameters can also be selected so that the clip delivery time is aconstant independent of the actual link transfer rate.

Technical Description of a Preferred Embodiment for a Cellular PhoneAccessory Unit Electrical Schematics

FIGS. 6 A-D illustrate the electrical schematics of a mobile phoneaccessory unit. (The image of the printed circuit board, as rendered byComputer Aid Design Software is shown in FIG. 7). Initial boot-up of theprocessor is done from the EEPROM (Electrically Erasable ProgrammableRead Only Memory) using passive serial SPI (Serial ProgrammingInterface) protocol. Thereafter, the boot loader code, read from theEEPROM, loads the main firmware from the Flash memory. The PIL (PhaseLocked Loop) of the processor is programmed for 5× multiplication ofclock frequency.

Firmware debugging is carried out through the JTAG (Joint Test ActionGroup) port using standard TI (Texas Instruments) software. External RAMis mapped both to the program and data space at the same addresses andoccupies all lower address space (64 k). Flash memory (Serial DataFlash) is accessed using software emulation of SPI protocol.

Audio code (Coder-decoder) works at approx. 22 kHz sampling rate (bothchannels). The Mode Control transistor selects the phone interface mode:either RS-232 control mode (closed state) or “hands free” mode (openedstate). The phone itself does not support simultaneous usage of thesetwo modes.

Both channels of RS-232 work at 11,5200 baud rate. From the PC side CTS(Clear to Send) and DTR (Data Terminal Ready) signals are supported.From the phone side CTS and RTS (Request to Send) signals are supported,with inverted polarities. Both channels employ hardware flow control.

The analogue signal level at the phone input is about 100 mV RMS (RootMean Square). The level at the phone output is about 600 mV RMS.

Cellular Phone with Accessory Unit

A cellular phone 900 with an accessory unit 902, according to oneembodiment, is shown in FIGS. 8 and 9 using the Ericsson R520 as anexample. The accessory unit is housed in thin cover 904 (see FIGS. 10and 11) providing a mounting body 906 for attaching the phone 900 to theunit, via an interface connector 800. The accessory unit contains theprinted circuit board 802 and speaker, preferably along the thin portionof the body.

The accessory unit is attached to the back of the phone using the phoneconnector 800. A snap mounting which utilizes a dimpled section on thephone case typically intended for a car phone holder may also be used.

The accessory unit includes the seIVer software and two voice modems,attached to the seIVer. The mounting body contains all necessaryelectronic components.

The tasks of the electronic components include playing back of apre-loaded sound clip upon a caller ID notification reception anddownloading new clips from the seIVer.

The body of the accessory unit preferably comprises a base 804, a thincover 806, and a molded cover 808. All three body components arepreferably made of high-quality aluminum-magnesium-copper alloy(duralumin) and are chemically covered with a protective oxide filmusing two different dyes (colored and black) for the two copies of thedevice. A dense rubber casing may also be used. Factors considered inselecting the body material include lightness (so that the accessoryunit would not exceed the phone itself in weight), mechanical strength,and the quality of electromagnetic shielding properties for protectingthe internal components from the waves radiated by the phone.

According to one embodiment, the accessory unit that embodies thedelivery system for a cellular phone attachment is about1.5″.times.1.5″.times.0.25″ and includes a small high fidelity built-inspeaker. The accessory unit may connect into the AC adapter fixture inthe bottom of a cellular phone. A dense rubber casing or glove may housethe device to protect it. The inside of the rubber glove may have amolded cavity that the device will fit into. The glove may have a circleof small holes which line up with the device's speaker to allow fullsound penetration. The device and glove may be sold in different designvariations both for marketing purposes and for fitting the differentcellular phones on the market.

Additionally, the accessory unit may connect into a cable connectorinstead of an AC adapter jack so that different jacks can be used. Thedevice may also be modified with four or five variations to fit thevarious cellular phone software systems, (not AC adapter variations)currently on the market. The device, including a speaker, may also bemade with several different adapter applications that would attach to avariety of different phonics.

A snap-on mount for fastening the accessory unit to the phone may belocated in the base body. The main purpose of the thin cover is toprovide the electromagnetic shielding. The molded cover 808 containsconnectors and some other components.

A simple snap mechanism for attaching the accessory unit to the phone,similar to the installation of a cellular phone to a car phone holder,may be provided.

FIG. 20 shows the accessory unit by itself, detached from the phone. Theaccessory unit can be detached from the phone similar to the detachmentof other accessories such as the phone charger (usually by applying arotating force rather than pulling straight out).

FIG. 21 shows the accessory unit uncovered. It is preferable to leavethe body of the accessory unit closed. Preferably, there are no glued,soldered, or other permanent junctions inside, however, the highprecision in the manufacture of some components could lead to theirdegradation after repeated assembly and disassembly.

The accessory-to-phone mounting is preferably designed to withstandrepeated attachment and detachment without degradation of the snap-onmounting or connector. Although the phone body is also durable, it ispreferable to attach and detach the accessory by shifting the snap-onlock upward manually during the attachment procedure (like to duringdetachment) to reduce wear of phone body near the latch.

The PCB (printed circuit board), located inside the accessory unit, is amulti-layer board which may have 0.2 mm gaps, two solder mask layers,and a silkscreen layer. The board preferably carries all the components,as illustrated in the schematics, excluding connectors and the speaker.Two outer layers of the board are signal layers; two internal layers areground and 3.3V power plane. For convenience of the PCB assembly onmodem plants, most packages are surface-mounted but not BGA. The boardpreferably does not contain any components requiring rare or custom-madeequipment for their assembly.

The phone connector is preferably selected to maximize the firmness ofthe attachment, taking into account significant dimensions of theaccessory unit. It should be mentioned that the connectors are unique tothe type of the phone used (Ericsson R520 and compatible, like R320 and1'28, in this example).

Factors in selection of the speaker for music playback included soundquality, which is primarily related to the speaker size, compactness,and weight of the speaker, as it is desirable that the speaker not bethicker and heavier than the phone itself. Depending on the availabletechnology, there may be some tradeoff between good speaker quality andhaving a lightweight speaker. Speakers used in professional radioreceivers-scanners may be a reasonable compromise since such speakersprovide better than usual sound quality while possessing reasonabledimensions and weight. Other options include either sacrificing weightand dimensions to increase sound quality or using the new so-called“ceramic” speakers that are now appearing on the market. Mention shouldbe made that although using these speakers could provide better quality,special modifications to the device would be required since thesespeakers could not be directly substituted in place of the standardones.

It should also be noted that the bandwidth of the acoustic channel ofthe cellular phone which, in turn, is non-linearly compressed andtransmitted over a digital channel of the phone, is much less than thebandwidth of the conventional landline phone and can deliver about 150bytes per seconds data transfer rate. Conventional landline phone coulddeliver about 3700 bytes per seconds (V.34). Using better speakers inthe phone would entail loading sound fragments of better quality (and,therefore, of bigger size), which would increase the time necessary todownload a melody. The problem could be solved by using the GSM digitaldata channel directly which would provide a rate of about 1000 bytes persecond for existing cellular networks and more than 7000 bytes persecond for newly deployed systems. Alternative solutions include: havingto tolerate an increase in the sound file or sound clip download time,downloading a melody from a local computer (the melody being deliveredto the local computer by some alternate means), and redesigning thesystem to support conventional (landline) phones. In the latter case,due to the significant increase in the device body size, it may makemore sense to use a stereo-effect (which is reasonable when the speakersof left and right channels have enough spatial separation).

Server Software Description

The server described herein performs the following functions: 1)startup, detection of the modem, detection of the melodies available; 2)answering incoming calls; and 3) servicing requests of user via D1MFcodes.

Upon startup, the application requests the user to select which deviceto work with. Possible options include local test mode (0), modem onCOMI port (1), and modem on COM2 port (2).

If the local test mode is selected, all sounds will be played back usingthe sound card of the local computer and the computer keyboard will beused to control the server (via numeric buttons instead of D1MFkeyboard). This mode is primarily for system testing purposes.

If one of the modems is selected, all sounds will be played back intothe phone line using the selected modem, and the calling party's phonekeyboard (D1MF tones) will be used to control the server. This is thenormal mode of server operation.

The answering of incoming calls is performed differently in the localand the normal modes. In the local mode, the application waits for the‘R’ key to be pressed to simulate remote party RING, while in the normalmode, the application waits for the RING signal from the modem. Then, ineither mode, the application initializes the device used (sound card ormodem). In the latter case, the modem goes “off-hook” and plays back thegreeting message and the main menu (e.g., O-End of the session,I-Current, 3-Next, 4-Load).

Thereafter, the application goes into calling party servicing loop. Exitfrom the loop is possible upon reception of D1MF code ‘0’ (or itssimulation using the keyboard) or after the 30-seconds timeout if noreaction is detected from the remote user. Additionally, if working withthe modem, the loop is exited when short beeps (“BUSY”) condition isdetected on the phone line. In the local mode, the ‘X’ key also leads tothe immediate exit of the application.

The calling party servicing algorithm may work as follows: the softwarekeeps the internal counter or number of the current sound clip.Initially, this number is “0” indicating that the clip is at the top ofthe list. Upon receiving the “1” command, the software plays back theclip with the current number. Upon receiving the “2” command, thesoftware increases the number and plays back the melody, i.e., plays thenext melody. Upon receiving the “3” command, the software decreases thenumber and plays back the melody, i.e., plays the previous melody. Uponreceiving the “4” command, the melody download is simulated. For theobvious reason, this mode is not implemented yet. Upon receiving the “0”command, the application switches the modem “on-hook” releasing thephone line and returns to the incoming call waiting state. Uponencountering any other command, the application plays back a standarderror message. At any moment, the server application can be aborted bypressing <Ctr1>C combination on the keyboard.

The server application keeps a log file (e.g., named“ProgramName_N.Log”) where N is a port number. Therefore, if twoinstances of the application are started, one for the modem on COMI andthe other for the modem on COM2, two independent log files will becreated. The log file contains brief information about user and serveractions, times of events, their main features, for example, state of themodem or the sound card. These files are intended to be sent to thesoftware developers in case of problems but can be used for otherpurposes as well, for example, to estimate the server load.

Due to the fact that the server application always plays a melody withthe same quality as one would be able to hear through the conventionalphone channel [monophonic, 8 kHz-sampling rate (signal bandwidth up to3.7 kHz)], the sound quality of the played back clips may be low. Thisis not related in any way with the quality of sound that would bedigitally transferred to the client's phone when the melody is selectedsince listening to the clips from server through the phone network couldnot deliver better quality than the phone channel itself. For thisreason, sound files compressed in monophonic versus stereo form would bepreferred since the rate of delivery would be faster, with no loss inplayback quality from the phone. At the same time, when the clips aredownloaded into the phone in digital form, significantly better qualitycould be delivered upon playback due to the perceptual compression;however, this would increase the transfer time.

The server software could also be implemented to track which clips weresent to which user or subscriber. This information could then be trackedand reported to different third parties such as the Copyright Office, orperforming or artist's rights organizations or societies.

Devices for Accessing Sound and Image Files

Electronic devices adapted to receive sound and image data, according tothe present invention, may be provided with an attachment or built inmechanism for providing consumers with Internet based or Internet freeaccess to a library of downloadable sound and/or image files. Consumersmay be allowed to download free clips of a song, musical composition, orother sound recording or movie or other performance onto any of thesedevices for use as alerts.

After hearing or viewing a clip, the user, preferably by the push of abutton, may transact a purchase of the full file associated with theclip, which may be downloaded to the device in its entirety, ordelivered to the user's address on an independent medium such asrecords, cassette tapes, CDs, videotapes, and DVDs. Such practice isintended to encourage the sales of sound and image files by giving theuser the opportunity to quickly make an impulse purchase.

A device for downloading and listening to music files, which is similarto a walkman type I-Pod™ device, but uses the same delivery method asdescribed for the cellular phone comprises a speaker and/or an earphoneset for listening to music with volume controls (such as Bose or ShureE5 universal earphones), and a server access element (which may beapproximately the size of a credit card). Such a device may be used as ahand held portable music player, as well as a car radio or home system,and may include larger speakers for use as an audio system by businessessuch as bars, restaurants and clubs.

In addition to features which allow a user to access the server library,the device may include other features common to conventional MP3 playersand/or Apple I-Pod™ devices. The server access element includescontrols, which may be buttons, for accessing, browsing, and downloadingfiles from the server to the device. Speed dial technology may be usedfor accessing the server. For browsing, a multi-task arrows button whichallows the user to browse, listen to samples, and highlight specificselections may be provided.

The server access element may include a small LCD monitor (approximately1″×1.75″) for text browsing the server library. A small microphone holemay also be included for allowing the user to browse the library usingvoice commands. The earphone set or speaker will enable the user listento downloaded sound files.

Downloaded files may be stored on a device storage list for accessing atall times, or deleted. Thus, the user may access a library containing alarge number of sound files, and browse, download, and listen to music,without the Internet or the need to plug into a computer. The consumermay be charged a fee for each download, or may be able to purchaseactual items, for delivery to an address indicated by the user, such asrecords, cassette tapes and CD's through the access element. Free clipswhich the user can download may induce the consumer into purchasing theentire sound file from where the clip originated.

The device may also include a mechanism for allowing a user to storedownloaded files on a medium, such as a card, independent of the device.To this end, the device may provide a slot into which a storage card maybe inserted, such that when the device is full, files may be downloadedonto the card for emptying space on the device. A security mechanism mayalso be included to prevent intellectual property abuse, for example, bypreventing users from playing copied files on other devices as describedabove. Such devices may further include a monitoring feature, whichwould allow performing rights organizations such as ASCAP and BMI tokeep track of music publicly played by business such as bars,restaurants, and clubs for the purpose of paying out royalties.

A schematic diagram for a media file monitoring system 1800, accordingto a preferred embodiment, is shown in FIG. 18, for use with an I-Pod™type listening device 1802, wherein a consumer may purchase copyrightregistered media files which are downloaded wirelessly to the device1802. The system 1800 includes an existing wireless network 1804 of 1.5Gor more, a system monitoring server 1806, and a system content server1808. The monitoring server 1806 monitors and counts every filedelivered to the consumer device 1802, for monitoring and counting everyfile delivered to the device 1802. The server 1806 may track eachindividually titled file which may include information such as songtitle and artist name, purchase price, the consumer's name, and otheridentity information, time of delivery, and any other pertinentinformation. The server 1806 may also protect encrypted copyrightedfiles from illegal file copying. The content server 1808 storescopyrighted digital media content licensed from multiple entertainmentcompanies. Thereafter, monitoring information, including statistics maybe transmitted (e.g. through the Internet) to a company or organization.The system described may also apply to a viewer device for monitoringimage files.

A portable laptop type viewer device, for accessing and viewing imageand/or sound files, may comprise a wireless earphone set and/or speakerfor listening to programs with volume controls, and a Personal DigitalAssistant (PDA) with a monitor which may be approximately the size of alaptop computer. This device allows the user to access a server librarycontaining a large number of movies, TV shows, cartoons, and otherfiles, using either text or voice activation, without the need to pluginto a computer or use of the Internet or other computer based wirelesstelecommunication system.

The files may be categorized and subcategorized by type of file (i.e.movies, TV shows) then by title or name of main actors. TV shows mayfurther be classified by providing a description for each episode,similar to a description provided in a TV Guide™. Other categories andsubcategories of classification may be provided to allow the user toidentify the exact file he wishes to access.

The device allows the user to browse, download, preview, store and viewhis selections, (using text, voice, or button commands), wherein a feemay be charged by the provider for any or all of these options. Thefiles may be made available as clips as well as in their entirety. Theviewer device may include a folder containing previously downloadedimage files that can be accessed at any time and deleted when desired.The library may be organized by categories such as type of show (i.e.movies, TV sitcoms), names of actors, show titles, sitcom description(e.g. as appearing in TV Guide™) etc.

Method of Advertising Using Delivery of Sound Clips

The method of delivering sound and image files, in accordance with thepresent invention, can further be utilized as an advertising tool. Tothis end, any of the above-described systems carrying the library ofsound and/or image files, which include a website and non-Internetaccessible holding unit, may be used to expose the user to sponsoredadvertising messages. For example, a user calling the holding unit mayhear advertising while the system is accessing the library.

Other advertising opportunities may be provided by utilizing a phone orother electronic device using alerts according to the present invention.For example, the phone may ring with advertising gimmicks such aspromotional messages. Such advertisement gimmicks may be played asdefault rings when no clip is selected for the ring. Additionally, aphone may be programmed to play, or transmit advertisementsspontaneously. Clips containing advertising messages such as jingles mayalso be provided. Advertising messages may be tacked onto a userselected clip of a popular song or the like.

Method of Distributing Music and Audiovisual Works to Consumers.

A method of distributing music and audiovisual works to consumers whileaccounting to copyright owners of the works comprises: (a) Makingavailable on a website various selections of works in various categoriesfor review by identifying information and offering a portion of the workfor hearing or listening, each work being coded internally withidentification to a copyright owner or its representative; (b) Allowingconsumers to select the viewable or listenable portion of the work fordata storage online or for downloading to the consumers' electronicdevices at home wherein the downloaded file being encrypted to only playon the consumer's electronic devices first receiving the download; (c)Optionally tracking those consumers who received the download of theportion of the work and reporting to the copyright owners or theirrepresentatives information concerning the download; (d) Allowing aconsumer to return to the website to purchase and download a completecopy of the copyrighted work previously sampled by the consumer; (e)Conducting an online purchasing transaction and charging the consumerfor the download; (f) Downloading a complete copy of the copyrightedwork to the consumer in an encrypted fashion so as to be playable onlyin the consumer's electronic device and not exchangeable with thirdparties; (g) Tracking those consumers who received the download of thecopyrighted work and reporting to the copyright owner of theirrepresentatives information concerning the download; and (h) Paying thecopyright owners or their representatives a portion of the moneyreceived from the consumers for their downloading of the copyrightedwork.

The delivery system, according to the present invention will alsointegrate with future wireless technology, such as 3-G systems, as itbecomes available, for offering enhanced capabilities for accessing,delivering, and using sound and image files.

While the present invention has been described with reference to apreferred embodiment or to particular embodiments, it will be understoodthat various changes and additional variations may be made andequivalents may be substituted for elements thereof without departingfrom the scope of the invention or the inventive concept thereof. Inaddition, many modifications may be made to adapt a particular situationor material to the teachings of the invention without departing from theessential scope thereof. Therefore, it is intended that the inventionnot be limited to particular embodiments disclosed herein for carryingit out, but that the invention includes all embodiments falling withinthe scope of the appended claims.

INDUSTRIAL APPLICABILITY

It is an object of the present invention to provide a unique method forusing sound and image clips as alert sounds for a variety of electronicdevices.

It is a further object of the present invention to provide a method forringing a cellular telephone using actual sound files including soundclips which may comprise real music with human voice, various instrumentsounds and other sound effects.

It is a further object of the present invention to provide a softwaresystem which may be integrated into existing cellular telephone hardwarefor enabling the cellular telephone to access and utilize sound filesincluding clips, without the need for extra hardware.

It is a further object of the present invention to provide an accessoryattachment for cellular telephones and for landline telephones whichwill enable the telephone to access and utilize sound files, includingclips.

It is a further object of the present invention to provide a securityfeature for devices capable of receiving and playing multi-media filesfor preventing consumer unauthorized dissemination of such files.

It is a further object of the present invention to provide a trackingfeature for devices capable of receiving and playing music files forproviding performers and writers rights organizations with an accuratemethod of determining royalty right payments to registered performersand writers.

It is a further object of the present invention to develop a website forbrowsing and for delivery of sound and image files including clips.

It is a further object of the present invention to provide a method forselling and buying products associated with existing copyrighted music,movies, TV shows, and other recorded performances.

It is a further object of the present invention to provide a deliverymethod for allowing a user to access, browse and download files that isindependent of the Internet, and does not require a plug in or handwired connection.

These and other objects, advantages, and the industrial utility of thepresent invention will be apparent from a review of the accompanyingspecification and drawings.

What is claimed is:
 1. A method of wirelessly delivering at least one compressed digital audio and/or visual file to a wireless device means, the method comprising: providing a website; wherein the at least one compressed digital audio and/or visual file comprises at least one of a full, partial, or segment of: a song, a musical score, a musical composition, a ringtone, a video or video segment, a movie or movie segment, a film or film segment, an image clip, a picture, a clip, an image, a photograph, a television show, a human voice recording, a personal recording, a cartoon, an animation, an audio advertisement, a visual advertisement, or combinations thereof; wherein the website is associated with the at least one compressed digital audio and/or visual file; wherein the website includes a plurality of visual images; wherein each visual image is associated with at least one compressed digital audio and/or visual file for selection of at least one compressed digital audio and/or visual file; receiving a first request from the wireless device means for the website; transmitting the website to the wireless device means based on the received first request; receiving a second request from the wireless device means selecting the at least one compressed digital audio and/or visual file based on the visual image; and streaming the at least one compressed digital audio and/or visual file to the wireless device means based on the received second request using orthogonal frequency-division multiplex (OFDM) modulation over a cellular data connection.
 2. The method of claim 1, wherein the wireless device means is a cell phone.
 3. The method of claim 2, further comprising storing at least a portion of the at least one compressed digital audio and/or visual file on the cell phone.
 4. The method of claim 1, further comprising storing at least a portion of the at least one compressed digital audio and/or visual file on the wireless device means.
 5. The method of claim 1, further comprising optimizing the digital audio and/or visual file according to an optimization scheme.
 6. The method of claim 5, wherein optimizing the digital audio and/or visual file according to an optimization scheme comprises: converting the digital audio and/or visual file to mono by summing of a left channel and a right channel if the digital audio and/or visual file is stereo and the wireless device means is configured only for mono playback; normalizing an amplitude of the digital audio and/or visual file; converting the digital audio and/or visual file according to a selected sampling frequency; performing a pre-emphasis filtering on the digital audio and/or visual file; and re-normalizing the digital audio and/or visual file.
 7. A method of wirelessly delivering a compressed digital audio and/or visual file to a wireless device means, the method comprising: providing a website; wherein the compressed digital audio and/or visual file comprises at least one of a full, partial, or segment of: a song, a musical score, a musical composition, a ringtone, a video or video segment, a movie or movie segment, a film or film segment, an image clip, a picture, a clip, an image, a photograph, a television show, a human voice recording, a personal recording, a cartoon, an animation, an audio advertisement, a visual advertisement, or combinations thereof; wherein the website includes a visual image associated with the compressed digital audio and/or visual file for selection of the compressed digital audio and/or visual file; receiving a first request from the wireless device means for the website; transmitting the website to the wireless device means based on the received first request; receiving a second request from the wireless device means selecting the compressed digital audio and/or visual file from a menu; and transmitting the compressed digital audio and/or visual file to the wireless device means using orthogonal frequency-division multiplex (OFDM) modulation over a cellular data connection based on the received second request.
 8. The method of claim 7, wherein the wireless device means is a cell phone.
 9. The method of claim 8, further comprising storing at least a portion of the compressed digital audio and/or visual file on the cell phone.
 10. The method of claim 7, further comprising storing at least a portion of the compressed digital audio and/or visual file on the wireless device means.
 11. The method of claim 7, further comprising optimizing the digital audio and/or visual file according to an optimization scheme.
 12. The method of claim 11, wherein optimizing the digital audio and/or visual file according to an optimization scheme comprises: converting the digital audio and/or visual file to mono by summing of a left channel and a right channel if the digital audio and/or visual file is stereo and the wireless device means is configured only for mono playback; normalizing an amplitude of the digital audio and/or visual file; converting the digital audio and/or visual file according to a selected sampling frequency; performing a pre-emphasis filtering on the digital audio and/or visual file; and re-normalizing the digital audio and/or visual file.
 13. A method of wirelessly delivering a compressed digital audio and/or visual file to a wireless device means, the method comprising: providing a website; wherein the compressed digital audio and/or visual file comprises at least one of a full, partial, or segment of: a song, a musical score, a musical composition, a ringtone, a video or video segment, a movie or movie segment, a film or film segment, an image clip, a picture, a clip, an image, a photograph, a television show, a human voice recording, a personal recording, a cartoon, an animation, an audio advertisement, a visual advertisement, or combinations thereof; wherein the website includes a visual image associated with the compressed digital audio and/or visual file for selection of the compressed digital audio and/or visual file; receiving a first request from the wireless device means for the website; transmitting the website to the wireless device means based on the received first request; receiving a second request from the wireless device means selecting the compressed digital audio and/or visual file; and streaming the compressed digital audio and/or visual file to the wireless device means using orthogonal frequency-division multiplex (OFDM) modulation over a cellular data connection based on the received second request.
 14. The method of claim 13, wherein the wireless device means is a cell phone.
 15. The method of claim 14, further comprising storing at least a portion of the compressed digital audio and/or visual file on the cell phone.
 16. The method of claim 13, further comprising storing at least a portion of the compressed digital audio and/or visual file on the wireless device means.
 17. The method of claim 13, further comprising optimizing the digital audio and/or visual file according to an optimization scheme.
 18. The method of claim 17, wherein optimizing the digital audio and/or visual file according to an optimization scheme comprises: converting the digital audio and/or visual file to mono by summing of a left channel and a right channel if the digital audio and/or visual file is stereo and the wireless device means is configured only for mono playback; normalizing an amplitude of the digital audio and/or visual file; converting the digital audio and/or visual file according to a selected sampling frequency; performing a pre-emphasis filtering on the digital audio and/or visual file; and re-normalizing the digital audio and/or visual file. 